Visual analytics of large multidimensional data using variable binned scatter plots
نویسندگان
چکیده
The scatter plot is a well-known method of visualizing pairs of two-dimensional continuous variables. Multidimensional data can be depicted in a scatter plot matrix. They are intuitive and easy-to-use, but often have a high degree of overlap which may occlude a significant portion of data. In this paper, we propose variable binned scatter plots to allow the visualization of large amounts of data without overlapping. The basic idea is to use a non-uniform (variable) binning of the x and y dimensions and plots all the data points that fall within each bin into corresponding squares. Further, we map a third attribute to color for visualizing data distribution and clustering. Analysts are able to interact with individual data points for record level information. We have applied these techniques to solve real-world problems on credit card fraud and data center energy consumption to visualize their data distribution and cause-effect among multiple attributes. A comparison of our methods with two recent well-known variants of scatter plots is included.
منابع مشابه
Variable binned scatter plots
The scatter plot is a well-known method of visualizing pairs of two continuous variables. Scatter plots are intuitive and easy-to-use, but often have a high degree of overlap which may occlude a significant portion of the data. To analyze a dense non-uniform dataset, a recursive drill-down is required for detailed analysis. In this paper, we propose variable binned scatter plots to allow the vi...
متن کاملAnalyzing Chromatin Using Tiled Binned Scatterplot Matrices
Background: Over the last years, more and more biological data became available. Besides the pure amount of new data, also its dimensionality—the number of di erent attributes per data point—increased. Recently, especially the amount of data on chromatin and its modifications increased considerably. In the field of epigenetics, appropriate visualization tools designed for highlighting the di er...
متن کاملOrdering Categorical Data to Improve
| Visualization provides a means for exploratory analysis of large scale, complex data. In domains such as network management, these data often have categorical attributes, such as host names and event types. Unfortunately, large scale visualiza-tions of categorical data are diicult to construct since categorical values have no inherent order. We consider two visual tasks: nding groups of simil...
متن کاملMultidimensional data visualization.
Historically, data visualization has been limited primarily to two dimensions (e.g., histograms or scatter plots). Available software packages (e.g., Data Desk 6.1, MatLab 6.1, SAS-JMP 4.04, SPSS 10.0) are capable of producing three-dimensional scatter plots with (varying degrees of) user interactivity. We constructed our own data visualization application with the Visualization Toolkit (Schroe...
متن کاملDrawing Georeferenced Graphs - Combining Graph Drawing and Geographic Data
DATA VISUALIZATION Full Papers A Linear Time Algorithm for Visualizing Knotted Structures in 3 Pages Vitaliy Kurlin Supporting Event-based Geospatial Anomaly Detection with Geovisual Analytics Orland Hoeber and Monjitr Ul Hasan The Stor-e-Motion Visualization for Topic Evolution Tracking in Text Data Streams Andreas Weiler, Michael Grossniklaus and Marc H. Scholl The Visual Exploration of Aggre...
متن کامل